• De-identifying Norwegian Clinical Text using Resources from Swedish and Danish 

      Lamproudis, Anastasios; Mora, Sara; Olsen Svenning, Therese; Torsvik, Torbjørn; Chomutare, Taridzo Fred; Ngo, Phuong Dinh; Dalianis, Hercules (Journal article; Tidsskriftartikkel, 2023)
      The lack of relevant annotated datasets represents one key limitation in the application of Natural Language Processing techniques in a broad number of tasks, among them Protected Health Information (PHI) identification in Norwegian clinical text. In this work, the possibility of exploiting resources from Swedish, a very closely related language, to Norwegian is explored. The Swedish dataset is ...
    • Deidentifying a Norwegian clinical corpus - An effort to create a privacy-preserving Norwegian large clinical language model 

      Ngo, Phuong Dinh; Tejedor Hernandez, Miguel Angel; Olsen Svenning, Therese; Chomutare, Taridzo Fred; Budrionis, Andrius; Dalianis, Hercules (Journal article; Tidsskriftartikkel; Peer reviewed, 2024)
      This study discusses the methods and challenges of deidentifying and pseudonymizing Norwegian clinical text for research purposes. The results of the NorDeid tool for deidentification and pseudonymization on different types of protected health information were evaluated and discussed, as well as the extension of its functionality with regular expressions to identify specific types of sensitive ...
    • Improving Quality of ICD-10 (International Statistical Classification of Diseases, Tenth Revision) Coding Using AI: Protocol for a Crossover Randomized Controlled Trial 

      Chomutare, Taridzo Fred; Lamproudis, Anastasios; Budrionis, Andrius; Olsen Svenning, Therese; Hind, Lill Irene; Ngo, Phuong Dinh; Mikalsen, Karl Øyvind; Dalianis, Hercules (Journal article; Tidsskriftartikkel; Peer reviewed, 2024-03-12)
      Background: Computer-assisted clinical coding (CAC) tools are designed to help clinical coders assign standardized codes, such as the ICD-10 (International Statistical Classification of Diseases, Tenth Revision), to clinical texts, such as discharge summaries. Maintaining the integrity of these standardized codes is important both for the functioning of health systems and for ensuring data used ...
    • Using a large open clinical corpus for improved ICD-10 diagnosis coding 

      Lamproudis, Anastasios; Olsen Svenning, Therese; Torsvik, Torbjørn; Chomutare, Taridzo Fred; Budrionis, Andrius; Ngo, Phuong Dinh; Vakili, Thomas; Dalianis, Hercules (Journal article; Tidsskriftartikkel, 2023)
      With the recent advances in natural language processing and deep learning, the development of tools that can assist medical coders in ICD-10 diagnosis coding and increase their efficiency in coding discharges ummaries is significantly more viable than before. To that end, one important component in the development of these models is the datasets used to train them. In this study, such datasets are ...